1 CAACCATTGC AGATCAGTGT GTGAGGGAAC TGCCATCATG AGGTCTGACA 
51 AGTCAGCTTT GGTATTTCTG CTCCTGCAGC TCTTCTGTGT TGGCTGTGGA 
101 TTCTGTGGGA AAGTCCTGGT GTGGCCCTGT GACATGAGCC ATTGGCTTAA 
151 TGTCAAGGTC ATTCTAGAAG AGCTCATAGT GAGAGGCCAT GAGGTAACAG 
201 TATTGACTCA CTCAAAGCCT TCGTTAATTG ACTACAGGAA GCCTTCTGCA 
251 TTGAAATTTG AGGTGGTCCA TATGCCACAG GACAGAACAG AAGAAAATGA 
301 AATATTTGTT GACCTAGCTC TGAATGTCTT GCCAGGCTTA TCAACCTGGC 
351 AATCAGTTAT AAAATTAAAT GATTTTTTTG TTGAAATAAG AGGAACTTTA 
401 AAAATGATGT GTGAGAGCTT TATCTACAAT CAGACGCTTA TGAAGAAGCT 
451 ACAGGAAACC AACTACGATG TAATGCTTAT AGACCCTGTG ATTCCCTGTG 
501 GAGACCTGAT GGCTGAGTTG CTTGCAGTCC CTTTTGTGCT CACACTTAGA 
551 ATTTCTGTAG GAGGCAATAT GGAGCGAAGC TGTGGGAAAC TTCCAGCTCC 
601 ACTTTCCTAT GTACCTGTGC CTATGACAGG ACTAACAGAC AGAATGACCT 
651 TTCTGGAAAG AGTAAAAAAT TCAATGCTTT CAGTTTTGTT CCACTTCTGG 
701 ATTCAGGATT ACGACTATCA TTTTTGGGAA GAGTTTTATA GTAAGGCATT 
751 AGGAAGGCCC ACTACATTAT GTGAGACTGT GGGAAAAGCT GAGATATGGC 
801 TAATACGAAC ATATTGGGAT TTTGAATTTC CTCAACCATA CCAACCTAAC 
851 TTTGAGTTTG TTGGAGGATT GCACTGTAAA CCTGCCAAAG CTTTGCCTAA 
901 GGAAATGGAA AATTTTGTCC AGAGTTCAGG GGAAGATGGT ATTGTGGTGT 
951 TTTCTCTGGG GTCACTGTTT CAAAATGTTA CAGAAGAAAA GGCTAATATC 
1001 ATTGCTTCAG CCCTTGCCCA GATCCCACAG AAGGTGTTAT GGAGGTACAA 
1051 AGGAAAAAAA CCATCCACAT TAGGAGCCAA TACTCGGCTG TATGATTGGA 
1101 TACCCCAGAA TGATCTTCTT GGTCATCCCA AAACCAAAGC TTTTATCACT 
1151 CATGGTGGAA TGAATGGGAT CTATGAAGCT ATTTACCATG GGGTCCCTAT 
1201 GGTGGGAGTT CCCATATTTG GTGATCAGCT TGATAACATA GCTCACATGA 
1251 AGGCCAAAGG AGCAGCTGTA GAAATAAACT TCAAAACTAT GACAAGCGAA 
1301 GATTTACTGA GGGCTTTGAG AACAGTCATT ACCGATTCCT CTTATAAAGA 
1351 GAATGCTATG AGATTATCAA GAATTCACCA TGATCAACCT GTAAAGCCCC 
1401 TAGATCGAGC AGTCTTCTGG ATCGAGTTTG TCATGCGCCA CAAAGGAGCC 
1451 AAGCACCTGC GATCAGCTGC CCATGACCTC ACCTGGTTCC AGCACTACTC 
1501 TATAGATGTG ATTGGGTTCC TGCTGACCTG TGTGGCAACT GCTATATTCT 
1551 TGTTCACAAA ATGTTTTTTA TTTTCCTGTC AAAAATTTAA TAAAACTAGA 
1601 AAGATAGAAA AGAGGGAATA GATCTTTCCA AATTCAAGAA AGACCTGATG 
1651 GGGTAATCCT GTTAATTCCA GCCACATAGA ATTTGGTGAA AACCTTGCTA 
1701 TTTTCATATT ATCTATTCTG TTATTTTATC TTAGCTATAT AGCCTAGAAT 
17 51 TCCATGATCA TGAGGTTGTG AGTATATCTC ATTCTTTCGT TGCATTTTCC 
1801 TAGGTGTGCT TACTCTCTTC TCTCACTTTG TGACACAAGG ACATGAATAC 
1851 ATCTAAATTT TCCTATTTCT GATATCACTG TTTCCATGAC GTCATTACTT 
1901 CTCTAACCTT AAGTGATAGG GTGACCTGCA ATATGCTGAT TCCTGGTGTT 
1951 TGCACAAACA CATGGATGTA AAGAAGTAAA AAATGTAAAA TTCACAAAAT 
2001 TCAGTAAACC AC AC AAAT C A ATGAAGCATT CTATGACATT AGCTTGTTAT 
2051 GAGTAACATA ATGATTTTTC TTTTTCAATT TAAATAAGCC CTTCTACATA 
2101 CCCAGCATTA CTGATCTCAG ACAATGAATT GCTAAAAATG ACGATAGGGC 
2151 ATT AC ACT CA GAATAGTTTG CTATATTTCC ACATACCTCA TCTAGATGTC 
2201 ATAGCCTACA TTTCTGCCAT CACTTAACTG ACATTTTTTG TGTGTTCTTG 
2251 ATGATAAATA GACAGTTCTT ATTATTGTCC TCAAATAATA AAAGAAACTG 
2301 AAATTTTCTT ACATAGAGAA AATGTCCATA AGATATTCAA GTTAAACAGA 
2351 TTATTTTGAG ATAAGTAACC ATTAGAAATA TGTGATTGTA ATTTCTGATT 
2401 TTATAAAATT TTAATTGATA GTACACTTGA TTTAAATGTC TATTCTTTAA 
24 51 AATGATGAAT ACTCATAATT CTTATCTCTA TAATCAAAAG TATAATTTAC 
2501 TGTAGAAAAA TAAAGAGATG CTTGTTCTGA AAGTAAAAAA AAAAAAAAAA 
2551 AAAACACTGT CATGCCGTTA CGTAGCGTAT CGTTGACAGC CCACTGTCAT 
2601 GCCGTTACGT AGCATATCGT TGACAGCGAC ACTGTCATGC CGTTACGTAG 
2651 CGTATCGTTG ACAGCACTGT CATGCGTTAC GAGCGTATCG TTGACAGCAC 
2701 TGTCATGCCG TTACGTAGCG TATCGTTGAC AGCAAAACAC TGTCAGCCGT 
2751 TACGTAGCG (SEQ ID NO:l) 



FEATURES : 

5'UTR: 1-37 

Start Codon: 38 

Stop Codon: 1619 

3'UTR: 1622 
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Homologous proteins: 

Top 10 BLAST Hits 

CRAi 147000022596013 /altid=g 
CRA| 1000682322899 /altid=gi I 
CRA| 335001098690982 /altid^g 
CRAI 18000005226060 /altid-gi 
CRA| 18000005155438 /altid=gi 
CRA| 18000005147363 /altid=gi 
CRA| 18000004953169 /altid=gi 
CRA| 18000005148770 /altid=gi 
CRA| 18000004903910 /altid=gi 
CRA| 18000005164 4 61 /altid=gi 
CRAI 1000682327923 /altid-gi I 
CRA| 18000005219476 /altid=gi 



i | 10438148 /def =dbj I BAB1517 9 . 1 | (AK. 
5802604 /def=gb|AAD51732.1| (AF1752. 
i|11436851 /def=ref |XP_003547.1| UD. 
15803213 /def-ref |NP_006789.1| UDP . 
14507821 /def=ref |NP_001068.1| UDP . 
16175083 /def=sp| P06133 |UDB4_HUMAN . 
(484383 /def=pir| I JN0619 glucuronos . 
13153832 /def =gb I AAC95002 . 1 | {AF064 . 
14507819 /def=ref |NP_001067.1| UDP . 
13426332 /def =gb I AAC32272 . 1 1 (AF081. 
588124 6 /def=gb|AAD55093.1|AF180322. 
18134780 /def=sp|Q9XT55|UDBJ_MACFA . 
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0.0 
0.0 
0.0 
0.0 
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BLAST dbEST hits: 

gi 1 10971169 /dataset=dbest /taxon=96.. 
gi 1 11068678 /dataset-dbest /taxon=96. . 
gi I 679005 /dataset=dbest /taxon=9606 / 
gi I 3173232 /dataset=dbest /taxon=9606 
gi I 3134358 /dataset=dbest /taxon=9606 
gi|10298020 /dataset-dbest /taxon=96. . 
gi|11974507 /dataset=dbest /taxon=96. . 
gi 1 11973717 /dataset=dbest /taxon==96 . 
gi 1 12673874 /dataset-dbest /taxon=96.. 
gi|10887798 /dataset-dbest /taxon=96.. 
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200 
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125 



e-103 
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2e-52 
2e-52 
2e-52 
8e-49 
le-47 
2e-40 
le-29 
4e-26 
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& 



EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLAST dbEST hits: 

gi 1 10971169 Kidney-hypernephroma 

gi 1 11068678 HepG2 cell line 

gi 1679005 Liver 

gi I 3173232 Kidney 

gi I 3134358 Kidney 

gi 1 10298020 Hepatocellular carcinoma 

gi 1 11974507 Normal pigmental retinal epithelium 

gi 1 11973717 Normal pigmental retinal epithelium 

gi 1 12673874 Kidney hypernephroma 

gi 1 10887798 Kidney 



Expression information from PCR-based tissue screening panels: 
Human fetal liver 
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1 MRSDKSALVF LLLQLFCVGC 

51 HEVTVLTHSK PSLIDYRKPS 

101 LSTWQSVIKL NDFFVEIRGT 

151 VIPCGDLMAE LLAVPFVLTL 

201 DRMTFLERVK NSMLSVLFHF 

251 AEIWLIRTYW DFEFPQPYQP 

301 GIWFSLGSL FQNVTEEKAN 

351 LYDWIPQNDL LGHPKTKAFI 

401 IAHMKAKGAA VEINFKTMTS 

451 PVKPLDRAVF WIEFVMRHKG 

501 TAIFLFTKCF LFSCQKFNKT 



GFCGKVLVWP CDMSHWLNVK VILEELIVRG 
ALKFEVVHMP QDRTEENEIF VDLALNVLPG 
LKMMCESFIY NQTLMKKLQE TNYDVMLIDP 
RISVGGNMER SCGKLPAPLS YVPVPMTGLT 
WIQDYDYHFW EEFYSKALGR PTTLCETVGK 
NFE FVGGLHC KPAKALPKEM ENFVQSSGED 
IIASALAQIP QKVLWRYKGK KPSTLGANTR 
THGGMNGIYE AIYHGVPMVG VPIFGDQLDN 
EDLLRALRTV ITDSSYKENA MRLSRIHHDQ 
AKHLRSAAHD LTWFQHYSID VIGFLLTCVA 
RKIEKRE (SEQ ID NO: 2) 



FEATURES : 

Functional domains and key regions: 

[1] PDOC00001 PS00001 ASN_GLYCOSYLATION 
N-glycosylation site 



Number of matches: 3 

1 131-134 NQTL 

2 313-316 NVTE 

3 518-521 NKTR 



[2] PDOC00004 PS00004 CAMP_PHOSPHO_SITE 

cAMP- and cGMP-dependent protein kinase phosphorylation site 



Number of matches: 2 

1 67-70 RKPS 

2 340-343 KKPS 



[3] PDOC00005 PS00005 PKC_PHOSPHO_SITE 
Protein kinase C phosphorylation site 



Number of matches: 6 

1 3-5 SDK 

2 120-122 TLK 

3 169-171 TLR 

4 200-202 TDR 

5 435-437 SYK 

6 520-522 TRK 



[4] PDOC00006 PS00006 CK2_PHOSPHO_SITE 
Casein kinase II phosphorylation site 



Number of matches: 9 



1 


62 


-65 


SLID 


2 


141- 


144 


TNYD 


3 


204- 


207 


TFLE 


4 


243- 


246 


TLCE 


5 


258- 


261 


TYWD 


6 


296- 


299 


SSGE 


7 


297- 


300 


SGED 


8 


419- 


422 


TSED 


9 


435- 


438 


SYKE 



[5] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

Number of matches: 2 

1 122-130 KMMCESFIY 

2 136-14 3 KKLQETNY 
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[6] PDOC00008 PS00008 MYRISTYL 
N-myristoylation site 

Number of matches: 4 

1 19-24 GCGFCG 

2 276-281 GGLHCK 

3 373-378 GGMNGI 

4 377-382 GIYEAI 

[7] PDOC00009 PS00009 AMI DAT ION 
Amidation site 

338-341 KGKK 

[8] PDOC00359 PS00375 UDPGT 
UDP-glycosyl transferases signature 

354-397 WIPQNDLLGHPKTKAFITHGGMNGIYEAIYHGVPMVGVPIFGDQ 

[9] PDOC00804 PS01047 HMA 
Heavy-metal-associated domain 

12-41 LLQL FCVGCG FCGKVLVW PC DM SHWLN VKV 



Membrane spanning structure and domains : 



ix 


Begin 


End 


Score 


Certainty 


1 


5 


25 


1. 


,802 
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2 
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0, 


.765 


Putative 


3 
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201 


0. 


, 779 


Putative 


4 


377 


397 


0, 


,735 


Putative 


5 


491 


511 


1. 


, 931 


Certain 
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BLAST Alignment to Top Hit: 

>CRA| 147000022596013 /altid=gi | 10438148 /def =db j I BAB1517 9 . 1 | 
{AK025587) unnamed protein product [Homo sapiens] 
/org^Homo sapiens /taxon=9606 /dataset=nraa /length=44 9 
Length = 44 9 

Score = 931 bits (2381), Expect = 0.0 

Identities = 448/449 (99%), Positives = 448/449 (99%) 



Query: 


79 


MPQDRTEENEIFVDLALNVLPGLSTWQSVIKLNDFFVEIRGTLKMMCESFIYNQTLMKKL 


138 






MPQDRTEENEIFVDLALNVLPGLSTWQSVIKLNDFFVEIRGTLKMMCESFIYNQTLMKKL 




Sbjct : 


1 


MPQDRTEENEIFVDLALNVLPGLSTWQSVIKLNDFFVEIRGTLKMMCESFIYNQTLMKKL 


60 


Query: 


139 


QETNYDVMLIDPVIPCGDLMAELLAVPFVLTLRISVGGNMERSCGKLPAPLSYVPVPMTG 


198 






QETNYDVMLIDPVIPCGDLMAELLAVPFVLTLRISVGGNMERSCGKLPAPLSYVPVPMTG 




Sbjct: 


6L 


QETNYDVMLIDPVIPCGDLMAELLAVPFVLTLRISVGGNMERSCGKLPAPLSYVPVPMTG 


120 


Query: 


199 


LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 


258 






LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 




Sbjct : 


121 


LTDRMTFLERVKNSMLSVLFHFWIQDYDYHFWEEFYSKALGRPTTLCETVGKAEIWLIRT 


180 


Query: 


259 


YWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSSGEDGIVVFSLGSLFQNVTEEK 


318 






YWDFEFPQPYQPNFE FVGGLHCKPAKALPKEMENFVQSSGEDGIVVFSLGSLFQNVTEEK 




Sbjct: 


181 


YWDFEFPQPYQPNFE FVGGLHCKPAKALPKEMENFVQSSGEDGIWFSLGSLFQNVTEEK 


240 


Query: 


319 


AN I IASALAQI PQKVLWRYKGKKPSTLGANTRLYDWI PQNDLLGHPKTKAFITHGGMNGI 


378 






ANI IASALAQI PQKVLWRYKGKKPSTLGANTRLYDWI PQNDLLGHPKTKAFITHGGMNGI 




Sbjct: 


241 


ANI IASALAQI PQKVLWRYKGKKPSTLGANTRLYDWI PQNDLLGHPKTKAFITHGGMNGI 


300 


Query: 


379 


YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 


438 






YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 




Sbjct: 


301 


YEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKTMTSEDLLRALRTVITDSSYKE 


360 


Query: 


439 


NAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYSIDVIGFLLTC 


498 






NAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYS I DVIGFLL C 




Sbjct: 


361 


NAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSAAHDLTWFQHYS I DVIGFLLAC 


420 


Query: 


499 


VATAIFLFTKCFLFSCQKFNKTRKIEKRE 527 (residues 79-527 of SEQ ID NO: 2) 






VATAIFLFTKCFLFSCQKFNKTRKIEKRE 




Sbjct: 


421 


VATAIFLFTKCFLFSCQKFNKTRKIEKRE 449 (SEQ ID NO: 4) 





>CRA| 1000682322899 /altid=gi I 5802 604 /def =gb | AAD51732 . 1 1 (AF175221) 
UDP glucuronosyltransf erase UGT2A3 [Cavia porcellus] 
/org=Cavia porcellus /taxon=10141 /dataset=nraa 
/length=530 
Length = 530 

Score = 795 bits (2030), Expect = 0.0 

Identities = 377/530 (71%), Positives = 435/530 (81%), Gaps = 3/530 (0%) 

Query: 1 MRSDKSALVFLLLQLFCVGCGFCGKVLVWPCDMSHWLNVKVILEELIVRGHEVTVLTHSK 60 

M K A LLL L C G GFCGKVLVWPC+MSHWLN+K +LEEL+ RGHEVTVLT S 
Sbjct : 1 MAPGKLASAVLLLLLCCAGSGFCGKVLVWPCEMSHWLNLKTLLEELVKRGHEVTVLTLSN 60 

Query: 61 PSLIDYRKPSALKFEVVHMPQDRTEENEI FVDLALNVLPGLSTWQSVIKLNDFFVEI 117 

IDY + A FEV-f + P D+ I F++LA+NV+P + WQS L FFV+I 

Sbjct: 61 NLFIDYNRHPAFNFEVIPVPTDKNMSENILNEFIELAVNVMPTMPLWQSGKLLQQFFVQI 120 

Query: 118 RGTLKMMCES FI YNQTLMKKLQETNYDVMLI DPVI PCGDLMAELLAVPFVLTLRI SVGGN 177 

L + C + +YNQ+LMKKL+++ YDV++ DPVIPCG+L+AE+L VPFV L+ S+G 
Sbjct: 121 TEDLGLNCRNTVYNQSLMKKLRDSKYDVLVTDPVIPCGELVAEMLGVPFVNMLKFSMGHT 180 
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Query: 178 MERSCGKLPAPLSYVPVPMTGLTDRMT FLERVKNSMLSVLFH FWIQDYDYHFWEEFYSKA 237 

+E+ CG+LPAP SYVPVP+ GLT RMTF+ERVKN + SVLF FWIQ YDY FW++FYS+A 
Sbjct: 181 IEKYCGQLPAPPSYVPVPLGGLTTRMTFMERVKNMVFSVLFDFWIQQYDYKFWDQFYSEA 240 



Query: 


238 


LGRPTTLCETVGKAEIWLIRTYWDFEFPQPYQPNFEFVGGLHCKPAKALPKEMENFVQSS 


297 






LGRPTTLCE +GKAEIWLIRTYWDFEFP+PY PNFEFVGGLHCKPAK LPKEME FVQSS 




Sbjct: 


241 


LGRPTTLCEIMGKAEIWLIRTYWDFEFPRPYLPNFEFVGGLHCKPAKPLPKEMEEFVQSS 


300 


Query : 


298 


GEDGIVVFSLGSLFQNVTEEKANIIASALAQIPQKVLWRYKGKKPSTLGANTRLYDWIPQ 


357 






GEDG+VVFSLGS+ +N+TEEKAN+IASALAQIPQKVLWRYKGKKP+TLG NTRL+DWIPQ 




Sbjct: 


301 


GEDGVVVFSLGSMVKNLTEEKANLIASALAQIPQKVLWRYKGKKPATLGPNTRLFDWIPQ 


360 


Query: 


358 


NDLLGHPKTKAFITHGGMNGIYEAIYHGVPMVGVPIFGDQLDNIAHMKAKGAAVEINFKT 


417 






NDLLGHPKTKAFITHGG NGIYEAI YHGVPMVG+PI F DQ DN+A MKAKGAAVE+N T 




Sbjct: 


361 


NDLLGHPKTKAFITHGGSNGIYEAIYHGVPMVGMPIFSDQPDNLAGMKAKGAAVEVNMNT 


420 


Query: 


418 


MTSEDLLRALRTVITDSSYKENAMRLSRIHHDQPVKPLDRAVFWIEFVMRHKGAKHLRSA 


477 






MTS DLL ALRTVI D +YKENAM+LSRIHHDQPVKPLDRA FW+EFVM HKGAKHLR A 




Sbjct: 


421 


MTSADLLGALRTVINDPTYKENAMKLSRIHHDQPVKPLDRAAFWVE FVMHHKGAKHLRVA 


480 


Query: 


478 


AHDLTWFQHYSIDVIGFLLTCVATAIFLFTKCFLFSCQKFNKTRKIEKRE 527 (residues 1- 


527 of 


SEQ 


ID NO:2) 








AHDL+WFQ++S+DVIGFLL CVA+AI L TKC LFS Q F K K K+E 




Sbjct: 


481 


AHDLSWFQYHSLDVIGFLLACVASAILLVTKCCLFSFQNFIKIGKRIKKE 530 (SEQ ID NO: 5) 



Hmmer search results (Pfam) : 

Model Description Score E-value N 

PF00201 UDP-glucoronosyl and UDP-glucosyl transferas 962.0 1.5e-285 1 

Parsed for domains: 

Model Domain seq-f seq-t hmm-f hmm-t score E-value 

PF00201 1/1 24 525 1 507 [] 962.0 1.5e-285 
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1 


TTCTAGAGGG 


TTGGAACAAC 




51 


CCTTCAGTAC 


ATGTTAAACT 




101 


TATTAATTTT 


TAACTTCTGT 




151 


GTAACTACAA 


AACCATTGCA 




201 


GGTCTGACAA 


GTCAGCTTTG 




251 


GGCTGTGGAT 


TCTGTGGGAA 




301 


TTGGCTTAAT 


GTCAAGGTCA 




351 


AGGTAACAGT 


ATTGACTCAC 




401 


CCTTCTGCAT 


TGAAATTTGA 




451 


AGAAAATGAA 


ATATTTGTTG 




501 


CAACCTGGCA 


ATCAGTTATA 




551 


GGAACTTTAA 


AAATGATGTG 




601 


GAAGAAGCTA 


CAGGAAACCA 




651 


TTCCCCGTGG 


AGACCTGATG 




701 


ACACTTAGAA 


CTTCTCTAAG 




751 


TCCAGCTCCA 


CTTTCCTATG 




801 


GAATGACCTT 


TCTGGAAAGA 




851 


CACTTCTGGA 


TTCAGGATTA 




901 


TAAGGCATTA 


GGTAAGACAC 




951 


AGAAATATTT 


TTAAAAATTG 




1001 


GTCAATGAGT 


TTTTTTAGAA 




1051 


GTATTTGTTC 


TAAGCGTAAG 




1101 


CTTAAAAATG 


ATCATCAATA 




1151 


AATAGCGCAA 


ATCAACATCA 


•=!;=■ 


1201 


TGAATGTTAT 


AGAATTTCTA 




1251 


TCTCATCATG 


TATTTAAGAA 




1301 


TTTCAAGAGT 


CATTAAATTG 


Oil 


1351 


GTGATTACTT 


CACAAAAACT 




1401 


TACGGTCTTG 


TGGGGCATAA 


1451 


ACCTATTTGA 


CTTATTTTTA 




1501 


TGCTTTGCAA 


TTTTCATATA 


: 


1551 


TGTTTTTATC 


CAAATATGTT 


ItJ 


1601 


ACAATTATAT 


CCGACACTAA 


!£ 


1651 


TATGCATAGA 


CTGTTTCACA 




1701 


TGTCCTTAGC 


CTTAGACATG 


FT 5 


1751 


AATAATTTTC 


CATTCAGTAA 


sfj 


1801 


CTTTATGGAT 


CTAGGTAAAC 




1851 


TC AG AT ATT A 


ATTTAAAGGG 


Q!t 


1901 


TTGGTAAAAT 


AGGATTATAG 




1951 


AATAAAACAA 


AAGTGTTATC 


•stir 


2001 


AAAAGTGGGT 


TCTGGCCATG 




2051 


AATCATGGAG 


CAGCTTGATA 




2101 


TTATGTAGCT 


CCAACTTACT 




2151 


TTATCTATTG 


TGAGTCTAAA 




2201 


AAAGGGATAT 


CAGTCAATTT 




2251 


ACGCACAGGT 


GTATTTCATT 




2301 


AAAATTGCAA 


TTTACAAATT 




2351 


ATCAATTTGG 


CGTAATATTT 




2401 


CACATATTGG 


CATTTATCAT 




2451 


TTACGGTGAT 


CTGGGATCAG 




2501 


TGGGGGCACC 


TTGATGTGTT 




2551 


TGTTGTGCAT 


GTACTAACTG 




2601 


TTCTTAGGCC 


TCCCTATTCC 




2651 


TTCTAAATGT 


TCCAGTGAAA 




2701 


AAAAATATAA 


AGGAATAAGT 




2751 


GCTTGACCCT 


ACAAAATCCT 




2801 


TGGGGCTTGA 


TGGGAAGTGT 




2851 


ACATTAATGC 


TCTCCCTGGG 




2901 


GGTCCCAGGA 


GAGATAATTA 




2951 


TCTCTTGCAT 


ATCTCTCATT 




3001 


ATCTTCTTCA 


ATGAGTGGAG 




3051 


TTGGTGCCAT 


GCCTCTTGTA 




3101 


CCTTTTTCTT 


TTACAAATTA 




TTTTCCCTGA TACATTGCAT TTTTTTGATA 
GGCAACCACC AGTGAACTTT ACTCTTAAAA 
GCTTATATTG TCATTTCAAC TCCTTGCTTA 
GATCAGTGTG TGAGGGAACT GCCATCATGA 
GTATTTCTGC TCCTGCAGCT CTTCTGTGTT 
AGTCCTGGTG TGGCCCTGTG ACATGAGCCA 
TTCTAGAAGA GCTCATAGTG AGAGGCCATG 
TCAAAGCCTT CGTTAATTGA CTACAGGAAG 
GGTGGTCCAT ATGCCACAGG ACAGAACAGA 
ACCTAGCTCT GAATGTCTTG CCAGGCTTAT 
AAATTAAATG ATTTTTTTGT TGAAATAAGA 
TGAGAGCTTT ATCTACAATC AGACACTTAT 
ACTACGATGT AACGCTTATA GACCCTGTGA 
GCTGAGTTGC TTCCAGTCCC TTTTGTGCTC 
AGGCAATATG GAGCGAAGCT GTGGGAAACT 
TACCTGTGCC TATGACAGGA CTAACAGACA 
GTAAAAAATT CAATGCTTTC AGTTTTGTTC 
CGACTATCAT TTTTGGGAAG AGTTTTATAG 
TTTTGTTTTA TTTTTAATTT AGTTATCAAA 
TCATACATTG TCTATGACAT AT AT AT GC AG 
AATGTTGTAG CTGTTTTTCA TAAAGAAAGT 
ATAACCTACT TTCTTAATAC CAGTAATATA 
ACTAAGAGAT TATATTTTGT ATTTCCTCCA 
CATATTTTTG AGAATCACTG ATTGTTAGTC 
TTGAAATAAA ATGCTAATCA TTATTTTCTC 
AATCTTCAGA AGGTCTTCTT TGAATTAATT 
AACATTTTCT AGAATTCTTT AATTTCTTAG 
TGAAAAAATA TTATAAAAAG TTAAAAAACT 
GATAGTAGAA TTTTTACTTT ACTGATATAC 
TTTCTTTGCT T TACT GAT AA AAAGTTGTTT 
GTTGTGATCA GAGCTGGTCA ATGCAAGACA 
TGAGAATTAT GTAGAAACAT GAAAAAAGGT 
AATATTGTTT AATGTATTCC AACGAATTCT 
GAACTAATAT TCAGAGGATC CCAGTTCAAA 
ATTTGAATTT ACATATATTG ATTTGCTTTA 
GCTGTGCCTA GCTGCAGATA GCCTACCAGG 
AATACAAATC TCTTGGCCTC AAGTCTACAT 
GTACAGCTAT ATAGAGGTCA CTGGCAAATT 
TAAAAGCCCC CTGACAAGAT TGAAATTTAA 
AAAGGGGTGA AAGAGCATTT TCCAATAAAC 
CAT TC AG AAA TTCCCCAACA ATTCTTTAAA 
TATAAGAAAT TCATTTAATA ACTATATTTA 
AAATTATTGA T TAT TAT AT A TTTTATAGAA 
TCAAGAGTAT ATATTCAAAC AACTATAGGA 
CAATTCAAGG ATTTATTTCC ATAAGTGCTT 
TTATTATACA TTGCTTTATT GTCCTTCACA 
AAAGGTTTTT GAAAACCTTG AATCAAGCTA 
CCAACAACAA GTGTGTACTT TTGACTCTAT 
GCTTTTTCAA ATTTTTCATT GTTATATCTG 
TGTTCCTTGA TGGTTACACG TTTATTAGCT 
ACAATATAAG ACAGCAAACT TAATTATAAA 
CTCCGCTGAT TCGTTTCCCC ATCCCACTTC 
CTGAGACACA GTAATATAAC ATACAATGAC 
AGAAAAGTAG CAGGTCTCTC AATTTAAACC 
TTAATGAGTA CTATAGTTTA GATATGGTTT 
GTGTTGAAAT TTGATCACCA ATATTGGAGG 
TAGGGTCATG AGGGTAGATT CCTTATGAGT 
GAAATGGGTG AGTTCGTTCT CACTCTATTA 
TTAAAAAGAG CCAGGAACAT CCACCTTCTT 
ATCTGATCCC TGCACTTGCT GGCTCCCAAC 
GAAACCAGAG GTCTTCACCA GACACAGATG 
TACCCTGAAG AATTGTGAGC CAAATAAAAA 
GACAGCCTCA GTTATTCCTT TGTAGCAACA 
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3151 AAAAAAGCCT GGGACAGGCC AAAAACTACA CCATTGCACC AAACAGTTAA 

3201 ACAAGATGTG AGTGCAAAGG AAAAGTTTTT GGAGGAAATT AAAAGTGCTA 

3251 CTCCAGTGTA CATACAAATG ATAAGAACAA ATAACCATTA TCAGTGCTGA 

3301 TATGGAGAAA ATTTTAGTTG TCTGGAGAGA AAATCAAATT AGCTAGCCAG 

3351 CTGCAGTGAT TCATATCTGT AATCCCAGTA ACTTGGGAGG CTCAGGTGGG 

34 01 AGAACGGCTT GAGCCCAGAA GTTTGAAGTC CAAGGCTGCA GTGAGCTATG 

3451 ATTGCTCCAC TGCACTCCAA CCTAGGTGAT AGAGCAAAAC CACTACCAAA 

3501 AAAAAAAAAA AAAAAAAGAA GAAAAAGAAA AGAAAAAAAA TTAAACCAAC 

3551 CACAACATCA CCTTAGGTTT TGGCATTAGC TAAAAACTAA TACATAGTAA 

3601 AGCGTTAACT ATTCAATTGC ATGAAACCTC AGAGAGGAGA GGAAGATGCA 

3651 GAAAAAAAGA CTGAAGCTAG TAGAGGTTGA CTAATGAGGT TTACAGGAAT 

3701 AAACTGCCTA CATGATGCAA AAGTTCAATG TGAAGCAATA GGAAGTCATG 

3751 CAGAAGACTT AGCTAATATA CTCAGTAAAT GTGGCTACAG TAAACAAATG 

3801 ATTTTCAATG TAGACCTAAC AGCCTTCTGT TGGAAGAAGA TGCCATTTAA 

3851 AACTTTCATA GCTAGAGAAG AGAAGTCAAT GCTTGTCTCT GAAGCTACAA 

3901 AAAACAGGCT GAATCTCTTG TAGTGGCTAA TGCAGCTGAT GACAAAGGTA 

3951 AAGCCAATGC CCATTTACTT TTTGTAATAA TTATAGAGGA CTCTTAATAA 

4 001 TTATGTTAAA TCTACTTTGC CTGTGTTATA TCAATGGAAC AACAAAGCCT 

4051 GGATGATATC ACATTGGTAT ATGACATGGC TTATTGAATA TTTTAAGCAC 

4101 ACTGTTGAGA CCTATTGCTC AAAAAAGAGG ATTCCTTTCA AAATATTGCT 

4151 GCTCATTGAC AATTCACATG GTCAACAAAG GGCTCTGATT AAGATGTACA 

4201 GATATTAATG TTTGCCTGCT TGCTATTATT ACATCCATCT TACATGCCAT 

4 251 GG AT CAT AT A GCCTTGACTT TCAAGTCTTA TGTAAGAAAT ATATTTTGTA 

4301 AGGCTATAGC TCTTACTAAT GGGGAAAGTA TATTGAAAAC CTTTTCAAAA 

4 351 GGATTTTTCA TTCTAGATTC CATTAAGAAC ATTCATGGTT CATGAGAGGA 

4 401 AGTCAAGATA TTAACATTAA CAAGAGTTTG GAAAAAATTT GATTCTAACT 

4 451 CTCCTGGATG ATTTTGAGGG ATTGAAGACA TCATGTGAAG AATTAACTGG 

4501 GGATGGGGTG GTCATGAAAA AATAAATAGA ATTATAAGTG GGCCTGAAGG 

4551 TTTGTCTAAA TTGCTATAAT ATCATGATAA AACTAAAACC TGTAAAACCG 

4 601 GTGAGGAGGT GCTTTTTAAA CAGTTACTTT TTATAGATGA ACACAGAAAT 

4 651 TGGTTTTGTG AGTTGGAATC TTCTCCGAGT GAAAATGCTA TGAACATTGT 

4701 TGAAATGGCT ACAAATGACT TAGAATATTA CACAAAATTA GTAGATAAGG 

4751 CAGCATCAAG GTTTGAGAGA ATGGACTCAA ATTTTGAAAG AAATTCTACT 

4801 ATGGGTAAAC TGCTGTGAAA CATCATCATA TGCTACAGAG AAATCTTTCA 

4851 TGAAAAGATG AGTCAATTCA TGCAACAATC TTTGTTGTCT AATTTTAAAA 

4901 ATTGTCCAGC TGCCCTGATC AATCAACAGT AATCAGCACT GAGGCAAGAC 

4951 CCTACACCAG AAAAAATAAA AATAAAAAAC CTCACTTGCT GAAGACTCAG 

5001 CTTATTATTA GCACTTTTTA GCCATACTTT TAACTAAGGT ATGTGCATTC 

5051 CTTTTTAAAC GTGATGATAT TGCACAGCTA ATAGCCTACA AGGTATGGTT 

5101 AACATAACTT TTATATGTCC TGGGACCCAA ATTTGTGTGA ATCACTTTAT 

5151 TGACATATTC CTTTTATTGA GATGAACTGC AACTTATCTT GCAATATCTC 

5201 CAAGATATGT GTGTATGGCA TTTCAAATAA GATGTGAAAT TATTTTATTA 

5251 GTATAAAAAG CAAATTTAAT TTTCTTTCCT TTGATCATCT TTATCCTTGT 

5301 TACTGTGTAT TTATCCTTTA AACATTGAAT GACTCCAATT GTTTAAAACT 

5351 GAGTCTTTCT TAAATGAGTC CTAATATCAT AGTAATTAAA ATCACCTACA 

5401 AGTTGGTAAT GCAGGCAGCA TGTGAGGCAC AGAAAACAAC AAATTTATAA 

5451 GACATAAATG CATTTGCTTG GAAGCTGAGA GAAGGCTCTA TTCTAATTTC 

5501 TGATAACTTC AAACTGAGTA TCTTCAGTAA AATTTATTCA CTATCAAATT 

5551 CAAGGCGTTT GGATTTATGA CCTAGGAAAA AACTTCAAAC ATTAAAATGT 

5601 GATGACCTTA AAAAGAGGCT CTCCACACTA TGGTGTATAA CACCACCAAC 

5651 TTTGATTAGA ATTTTAAAGA GAAACAAATT CTCTTATGGA GTTTATCTTT 

5701 TTATCACTTG CAAAATATGT TTTTGTAAAG AGATACTAAT TACTTAGTTA 

5751 TTTGTAGTTA GCCATTCTTC TGATTAAAAA CCTAAAATTA AATCTTGAAA 

5801 ATGTGTTTTC CTTCAAAACA CATCATTTGA GAGAAACACT AAAGTAAGTG 

5851 TATGATTATC ATAGCATGTA CATAGGTGCT TCACAACCCA AAAAGAATAT 

5901 TGTCATGGGT AAGAATCAGT AAAGGAATTT CTCCTAATAA AACAGTAGCC 

5951 TATTAATTAA AGTAATGATA TGCAATACAG CAAGTTAAAG GGAACTGATC 

6001 CTGGTGGGAT TATTGAAAGA TATACCCTTG ACTATAGATT AGAAAATACA 

6051 GAGATGTTAT TTAGTGAAGA TATTGTGGTA CTCATTTATC ATCTGCAATT 

6101 CACTTGCAGA GGAAAAAATG AGTAATAAAT TCATTTGCAT TTTGGATTTG 

6151 TGTCTTTAAG TTGTGAAAAT ACACTTAAAT ATAACCATCT GTCCTTTGCT 

6201 CCTTCCTTCC TTCCTTCTTT CCTTCCTTCC TTCCTTCTTT CCATCCTTCC 

6251 CTCCCTCCAT CCTTCCTTCC TTCCTTTCTT CCTTCCCTCC TTTCTTTCTC 
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TCTTTTTTCC TTTCTTTCTT 
CTTCCATTTG ACGTCTAAAA 
TTTTTTCTTA ATAGCTTACT 
ATTGTATGCT AATTCATTCT 
CATTATGTGA GACTGTGGGA 
TGGGATTTTG AATTTCCTCA 
AGGATTGCAC TGTAAACCTG 
ATTAAGGAAT ATTATGTACT 
TTACAACCTT AGTTATAGAA 
TCCATATTTA TTTTCCATGC 
TTTTCGGTAT TATATTGAGA 
AGATTTTAAA ATACTAGTAT 
TAGGGAATTG GAGCCATGAA 
CACAATCAAA TTGGGCTTAT 
AATAAGCATA TTTTCATGAG 
GAGTGAGAGA AAATAAGTGA 
CAATAAATGG AAAGTTTAAG 
ATAAGTGCTT AGAAGTCATC 
TTGTGACTAT CCACTTTGAG 
AAGCCATTTG AGAAAAACAA 
TATCGATAGT TATCAGGACA 
CTGCAGGTTT GTTTTGGAGT 
TCATCAGAGG AAAAAATTTA 
TGAGAAGTAA AAGTGAGTAA 
AGTATGTCTT GAGTTCACTG 
ACCTGACAAG GTTCTAGGAT 
TTACGGGATA CTTTTTATTC 
TCTATACCAA GATTGTTCTT 
ACTGTGTCTC TTGTATGACC 
ACATTATAGG AAACCCTGAC 
GTGAGAAAAC ATGGGAGAAA 
GTCTTCTTTG ACCATTTCTA 
CACTAGGAAG AAGAAAATTC 
GAGACCAAGA AAGAGAGTGA 
TCCAGGGTGT TACCTAGGTG 
TTTAATGCAA GCAGGCACAA 
TAGTCACTTT GGCATATCTG 
GTAGGCTGTA TCTAGTTGTC 
AAGTAAAAAT CCTGACTGAA 
TTTTAAACGG TGTCAGTGTT 
CAACTTATAT TTTAAATGCA 
ACCACAATAG CTATGGTAAC 
AAAAGAAAAC CCTCATTTCC 
AAGGAAGGTT TCAGAGCCAG 
AGTAAGATGT GTATTGATGA 
ACTTTACTTA CCAGGAAAGT 
TGGCATCTGT ATCTAAAATA 
TCATTCTCTA GGTAATCTCC 
CCAGTTTTAC ACACACACAC 
AAATAATCAT AGAAGACATA 
AATTTATTAG GTAAAAGCTT 
GATATATTTT AGGGCTGTCT 
TTTGACATGT AATAATTGCA 
ATACATTTAA ATGTTGTGTA 
CTAAATATTT ATCATTTCTT 
TGGCTATCTT GAAATATGTA 
AACTTAATAT AATAACAGAA 
ATTGATCCAC CATTTCTCAT 
TAACCACAAT TCTACTCTCT 
ACATATAAGG GATACCATGC 
TATGCCCTGG AGGTTCATTC 
TTTTTTTCCA ATATATATTT 
ATGTATATAT ATATATATAT 



TTTTCTTCTT TATTATTTCA 
GCCATGTTGT TCTAGAGGAC 
GAAAAATTAG TGATACAATT 
GTTATTTCTT TTATTGAGGA 
AAAGCTGAGA TATGGCTAAT 
ACCATACCAA CCTAACTTTG 
CCAAAGCTTT GCCTAAGGTA 
TTATGACATG ACTTGTTTTC 
GGATGATGTT GAATGTCGTC 
CACAGGGGCT CTTATAGGTG 
AAGTAGGCAG AAGAATTTCA 
TACAATAGTT TGGATAATAA 
GATCACTAAA AAGAATGCTC 
GAACAAGGAT ATTTGTCATG 
ACATATTGGA TATATTCCAC 
TGAAGGAAGA CAAAGAATAA 
TGTTTAATGA TAGTGATGAC 
TTGTTTGTGA TTTATATGAT 
CTCGTGAGAA TGTTAGGTGA 
GGTTTCAACC TCTGTGGACA 
AAGTAGAGCT CATAGAAATA 
GAAAATAAAA TTGTATACTA 
TAGTTCAAGG AATGTTGAAA 
TAGTTGTTAT AGTTTTTTAA 
TCCCAAAAGT GGCTATTAGC 
ATTTAGTCAT GGATGTTCAT 
TGATGAACAG CCTAATGCCT 
ATAGGGAACT TGTTTACACT 
TATGTCTTCT TTATCCCTAC 
AAGGCCAGAT GTTATATTTG 
CTTAACCAAA CACATAAAAT 
GAGAAAAGAG TTCAGCATCC 
TCTGGGAAAA GCACATTCAA 
GGGATCTATG TGCCAAAATG 
GGTTTCCAAT GGGGAACTGT 
AGTCCATGGA GGCATTCTGA 
CACAGAATCT GATCAGTGAT 
CTATAGGGTG GTTACCAGGA 
CACATTGAGG AAATGGAAGG 
GACTAAGACC TGCTTCTGGT 
TAGCCAGACA ACATAAAATT 
AATACTGGGT TTACCTATTA 
CATTTTATGG AAATATAATC 
TAGGATTTCC AGAAAAATTA 
ATATAATTTT ATTTATTAAT 
TATACCAGAA AACCAAGCTC 
GAAATACAGA AGGAGAGCTG 
CATGCCATTC TACCCTTTAT 
AAACACACAC ACACAAACAC 
TTTTTAAAAA AGTTAGATCC 
TTGTGCTGAT AATTTTACAA 
TACACTAAAT ATTTATTTTT 
CATGTTTAAG AGAAATGCTG 
ATAATTACAT CAAGATAATA 
TGTGGTGATA ACATTCAAAA 
ATACATTACT ATTAACTATA 
CATATTCTTC CAAATTTAAA 
TGCCCTCCCT ACTATCTCTT 
AATTATATTA TGAATGCATT 
TATCTCTGCC TGGATTATTT 
ATGTTTCTAC AAATGACAGG 
AATGAAATGG ATATATATAA 
CTCCAGTGGA ATGCTATTGA 
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9451 GCTATAAAAA AGTTAATATA 

9501 GGAATGGATA TATATATATA 

9551 AACACACGCA ATATACATAT 

9601 AGAGAGAGAG AGAGATATTT 

9651 GGACATATAT ATATGTATAT 

•9701 ATTAATGGAT GTTTAGGTTG 

9751 ATGTAAACAT AGAAGGACAG 

9801 TATACACCCA GTAATGGAAA 

9851 TTTTTGAGGA ACTACCATAC 

9901 TTTCCCTCAA CAGTTTATAA 

9951 ACTTGTTATT TTTGCCTTTT 

10001 TCTCATTGTA GTTTTGATTT 

10051 ATTTTGTAAT TATACTTCTT 

10101 CTATTCAGGT CTTTTGCTTA 

10151 TTGACTGATG TGACTTCTAT 

10201 TTCATAGTTT GCAAATATTT 

10251 TGTTTGCTTC ATTTTCTCTG 

10301 TATCTACTTT TCCTTTTGTT 

10351 ATCCTTGCCT AGACCAATGT 

10401 GTAGTTGTAT AATGTTTGGC 

10451 TTACTTTGTA TATGGTGAGG 

10501 GGATATGCAG TTTTCCTAGC 

10551 ATTATGTGTT CTTGGCACCT 

10601 TCCATTTGTG TTGTTATAGA 

f-% 10651 CAAATAGAAT TATTTGAATG 

%l 10701 CACTGACTTC TGCTTGGCCT 

10751 TGGTAGAAGG C AAAC AT GAG 

^ 10801 AGGAAACCAT AAAGAGGGGA 

0|| 10851 CTAGCTCTTC TGGGAACTAA 

£ 10901 ATTAATCTAT TCATGAGGGA 

j^g 10951 AGATTCTACC CCCAATATTG 

s« 11001 GCTCAAATAT CCATACTATG 

J" 11051 GATCTCTATT CTATATAGTA 

W 11101 CCATACTGTT TTGGTGATGA 

3 11151 ATATATTATA TATATGTATA 

fp 11201 GTATATTATA TATATATAAT 

11251 TTAAGTTCAA GGGTACATGT 

J ,; f 11301 ACATGTGCCA TTTTGGTTTG 

H 2 11351 TATTTCTCCT AATGCTATCC 

fl:| 11401 AGGTGTGTGA TGTTCCCCGC 

p 11451 CCTACCTAAA AGTGAGAACA 

11501 GTTTGCTGAG AATGACTGTT 

L?t:= 11551 ATGAACTCAT CCTTATTTAT 

11601 CTACGTTTTC TTAATCCAGT 

11651 AAGTCTTTGC TATTGTGAAT 

11701 TCTTTATAGC AACATGATTT 

11751 GATAACTGGG TCAAATGGTA 

11801 ACACTGTCTT CCACAATGGT 

11851 AAAAACGTTC CTATTTCCCC 

11901 CTTTTTAATG ATGGCCATTC 

11951 TTTTTGTTTG CATTTCTCTG 

12001 TGTCTTTTGG CTGCATAAAT 

12051 CTTTGCCCAC TTTTCAATGG 

12101 TAAGTTCATT GTAGATTCTG 

12151 GCAAAAATTT TCTCCCATTC 

12201 TTCTTTTGCT GTGCAGAAGC 

12251 TTTCGGCTTT TGTTGCCATT 

12301 CCCAGGCCTA AGTCCTCAGT 
12351 TATGGTTTCA GGTCTAACAT 
12401 TTGTATAAGA TGTAAGAAGG 

12451 CGTGTTTTCC CAACACCATT 
12501 TGATTTTGTC ATATTTGTCA 
12551 ATCACTGAGG CCTCTTTTCT 




TAATAGAAAT AAAGCTTATA TATATCTAAT 
ATGGAATAGA AATATATATC TATACATATA 
CCATTTCATT G CAT AT AT AT ATATATAGAG 
TCAAATGTGT GTATATATAT CCAATGGAAT 
TTTTTCCATA TTTTCTTTAT GTATTTCTTC 
ATTCATCCCT TGGGTATATG AATAATGTTG 
ATATCTCTAT GACTTCTTAG TTTATTTAAA 
TGCTGTATAA TATGGTAGTT CTATTTTCAT 
CGTTTTCCTT ACTAATTGTA CTAATTTGCA 
AAGATCTTCT TTCTCTGCAT ACTTTCTAGC 
GATAATAGCC ATAACAGGGG TGATGTGATA 
GCATTTCCCT GATGATTAGT GATTTTGAGC 
AGTCACTGAT AGTCTTCTTT TGAGAAGTGT 
TTTTTTAATC AAATTAGTAA TTTATTTTTA 
GTATATTTGA GATAGTAACT TATTGTCAGA 
TTCATGTTGT GAATTGTCTC TTCACCCTGT 
CACAAGCTCA ATGCTTTGAT ATAACCCATT 
GGCTGTGCTT CTGAAGTCCT ATCCAAAAAA 
CACAAATCAT TCCTCCTACA GTTTCTTCTA 
CTTATATTTA ACTTTGTAAT TCATTTTTAC 
GATAGAGGTC TAGTTTCATT TTCTGCATGT 
ACCATTTAGT GAAGAGGTTG CCTTTTTTCT 
TTGTCAAAAG TCAGTTAGCT GCTATATTCC 
GGAACACATG AGACTAGCAA ATTTATATAT 
ATAGTTCTGC ATACTGTACA AGAAGGACAG 
CTGGTAAGGT TCTCAAGATG CTTCCACTTG 
CTGGTATATG CAAAGGTCTC AT G AC AAG AG 
TGTGAGGGAG TGCCAGGTTT TGTAAAACAA 
TAGAGTAAAA ATTCGCCTCC CAGGCAGGGG 
TCTGCTTCCA TGACAAAGGC ACATTCTGTT 
GGGATCAAAT TTTAACATGA AGTGTGGAGG 
GCAGCAGTAA ATGCATAAAT TTATTTTGTG 
TTGGTGTATG TATCTGTTTT CATGCCACTG 
TATCTATGCT ATATATGTGT GTGTGTATAT 
TATGTGTATA T TAT AT AT AT GTATATATGT 
ACTTTAAGTT T TAT AT AT AT ATAAAATACT 
GCAGGATGTG CAGGTCAGTT ACATAGGTAT 
CTGCATGCAT CAACTCATCA TTACATTAGG 
CTCCACCAGC CACCCAACCC CCAACAGGCC 
CCTGTGTCCA TGTGTTCTCA' TTGTTCACTT 
TGCAGTGTTT GATTTTCTAT CCTTGTGATA 
TTCAGCTTCA TCCATGTCCC TCAAAAGGAC 
GGCTGCATAG TATTCCATGG TGTATATGTG 
CTATCACTGT TGGACATTTG GGTTGGTTCC 
AGTGCTACAA TAACCATATG TGTGCATGTG 
ACTATCCTTT GTGTACATAC CCAGTAATGG 
TTTCTAGTTC TAGATCCTTG AGGAATCCCC 
TGAACTAATT TACATTCCCA CCAACAGTGT 
ACATCCTCTC CAGTATCTGT TGTTTCCTGA 
TAACTCACAT GAGATGGTAT CTCATTGTGG 
ATGACCAGTG ATGATGAGCA TTTTTTCATG 
GTCTTCTTTT GACAAGTGTC TGTTCATATC 
AGTTGTTTGT TTTTTTCCTG TAAATTTGTT 
GATATTAGCC CTTTGTCAGA TGGGTAGATT 
TGTAGGTTGC CTGTTCACCC TGATGGTAGT 
TCTTTAGCTT AATTAGATCC CATTTGTCAA 
GCTTTTGGTG TTTTAGTCAT GAAACCCTTG 
GGTATAGCCT AGGTTTTCTT CTAGGATTTT 
TTAAGTCTTT AATCCATCTT AAATTAATTT 
GATCCGTTTC AACTTTCTAC ATATGGCTAG 
TATTAAATAG GGAATCCTTT CTCCATTTCT 
AACATCACAT GGTTAGAGAT GTGTAGTGTT 
GACTCCATTG ATCTATATAT CTGTTTTGAT 
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12 601 ACCAATACCA TGTTGTTTTC GTTACTGCAA CCTTGTAATG CAATTTGACA 

12 651 TTCAGGACCA TGATGCCTCC AGTCTCTTTT TTTTTTTCTA AATAATTTTT 

12701 TTGTCAATGT AAGCTCATTT TCGCTTCTTT CTGATCCATA AAGTATTTTT 

12751 TTCCCATTTT GTGGAGAACG CCGCNNNNNN NNNNNNNNNN NNNNNNNNNN 

12801 NNNNNNNNNN NNNNNNNNNN NNNNNNNNNN NNNNGGCACA CCTCGTGCGC 

12851 AT AT AT AT AT ATATATATAT ATATACCTCT ATATATATAT ACATACATAC 

12901 ATACATACAC ACCTCCTTGT CTGGTGTGGG ATCAGGGTAA TGCTAGCCTC 

12951 ACAAGATGAT ACTGAAGTGT TTTTGCCTTT TTGACTTTTT GATGGTTTGG 

13001 AAGAGTGAGA AAAAGTGTTA TTAATTATTC TTTAAATTTT GTTGAATTTC 

13051 ATAGTGAAGA CCTTAGCTCA CTGGCTTTTT TAATGAGAAC TTTATTACTG 

13101 ATTTAAACTT CTTCTTCATT ATTTATTTCT GCCTTGTTTT TATTTCTTCA 

13151 TAATCCAGTC CTATTTTATG TGTCCACTAA ATTGTTTATT TTCCTAGAAT 

13201 TTTTCCATTT ATTGGCATAT GCATGTCCAT AGAAGCCTTT TATAGTCCTT 

13251 TTCATTTCTA GTGTCATTTT TTTCCTTTTT TTTAAGAATC CTTAAGATTT 

13301 TAGAGATGAA ATGTCACTTT GTTACGCATA CTGGAGTGCG GTGACATTAT 

13351 TATAGCTCAC TGAAACCCAA ACTCCTGAGT TTAAGCAATC CTTCTACCTC 

134 01 AAAATTCCAA AATTCCTGAG TAGCTGAGAC AGGCATACAC CATCAAGACT 

13451 GGCTAATTTA TTTCAAATTT TGTAGAGATG GGTTCTTACT AAGCTATTCT 

13501 CAATCTTTGG GCTTCAAGTG ATTCTTCAGC CTCTGTCTCT GAAAATGCTG 

13551 GGTTTATAGA TATGAGCCTC TATGCCTGAT TTGCTTTGTC TCTTTGTAAT 

13601 CTCCCATTTT ATTTGTGTCT TTTCTGGTTT GTTTCATTTT GTTATGTTTT 

13651 CAGTTACCTT GCTAAAGCTT TGTCGATTTT ATCTCTTCAA ACAACTAACT 

13701 CAATATTTTG CTGATTTTCC ATATAGTATT TTATTTCTAT TTCATTTATT 

13751 TCTGCTCTAA TCTTTGTTAA ATATCTTGTT TTCCTAATAA TTTTGAGTTT 

13801 CCTTGTTCTT GTTTTCTAAT TCCTTGCGAT GTTATCATAA ATTGTTTATT 

13851 TGATATCTTT CTACTTTTTT GATGTGTGTG TTCGTTGTTG TAGACTTTCC 

13901 TCTTTATTAT TCTGATTTCT TCCTCAATTC TCTAATATTA TGATTGCATT 

13951 ATTTTCCAAG TTTCTTTTGT TTTTTTATTT ATAGTTTATG TGATTCCTGA 

14001 ACTTGTCAAA GAGATTATTG TGAATTTGAT GTCGGATATT TAAGCATTTT 

14 051 CAAAACTTTT GGTGCATTAT TGAAATTTTA TTGGTTTATT TTAGAGATGT 

14101 CATACTTCCC AGTTTTTTTT TAACAATACT TGCTCTTTAT ATTGATGTCT 

14151 ACATATTTAA AAAGATAACC ACCTGATTCA GCTTTTTAAG GTGATATGCA 

14201 GTGGTGTTAA GTGTGTACTG CTTAATATCA GAGCTGAATC ACTGCCCTGA 

14251 GGATTCTTTC TGTTCTGAGG AGAGCTTGTA GTTAATAGCA GAACCTAAAT 

14301 AGTGCAGTAG AGCTAAATCT CTTCCATGCT GTTGTTTTCC TGTCTGGGGA 

14 351 AGACTTATCA TGACCATGAA AACATAATGC TGTGCCAGAA CTTAAACCCA 

14 4 01 AACCTGTAGT AATTTCTGAG TTGAGGAAGG CTTAAGAAAT AACTGGAACT 

14 4 51 TAGTTACTAA CCTGATAGTT GTTTCTGAGT CAGAGAAATG CTCTGCATGA 

14 501 TCACCTGGGA TATTTGTAAA ATCTAACCAA AGATTCTAGC CTTCCCCTTG 

14 551 GATTGTGCCT CCTGTACTAC TGTAGTGCTG GCTAGGTCCT CATCAGTGAA 

14 601 TTCCCTGCTG ATAGGACCAC AAAGCATCTG CCAAGATCTG TTTGCCATTT 

14 651 GCTGTGATTA GTGCTTCTGC TCTTTGCTTC CAATTCAACT CAGGTGGTTC 

147 01 AGCCCTTCTG ACACTCCTAA TACCTCCTGT GGGATGGAAC ATAGAAGGCT 

14751 TCTCACAATG ATTCACACAC TGATATGGAG ATTGAATGTC CAGTTGCAAC 

14 801 TATTTTCTTC CACCTGTGTA ATTGCAGGTA CAGGGAAGTT TTCTGTGACT 

14 851 GATGCTATTT TGGTTTGGAG AATGGGGTGA TGTGGCACAA TGATCTTTCT 

14 901 TCTTTCTGGT CATGGATTTT TTAATTTCCA TGAACCCATA AGATTTTTCA 

14 951 CTTTTCTTCT GAGCTCTGGT GCTTTCAGAG TGGTATTTTT ATATTCGAAT 

15001 AGTTGCTAGT TGTACTTTTA AAAGCGATTG ATGCTGGAGG TCTTCTATTC 

15051 CACCATCTCG CTGATGTCAG TCCTCAAATA ATAATTTTAT ATTTTAGCAA 

15101 ATTATTTTGG TTTTAGGATT TTGTGTCTAC GTGACACAGA CATGAAAAGA 

15151 GATGTACTCA TTACTGAAAC TTTTTGCATA CTGTTTTGGT TGTGCGCCTT 

15201 TTCTAGTATG AATGATTACA TATTTAAGCC ACATGTTTTA TACATAGACT 

15251 GTCCTTTAAA GAG AC TAG AT AGTTCTGTGT GTCAGCATAT AGGGACAGAA 

15301 TATAACTACA CATTAATAAT TTCTCAAGTA TTTATTTTAG AAGTGTAAGT 

15351 AACCTTTATT TTAATTTTTG TTATATTATG CCTCTGTAAT GCAGATAAAT 

15401 TTTTATCTTC AGGAAATGGA AAATTTTGTC CAGAGTTCAG GGGAAGATGG 

15451 TATTGTGGTG TTTTCTCTGG GGTCACTGTT TCAAAATGTT ACAGAAGAAA 

15501 AGGCTAATAT CATTGCTTCA GCCCTTGCCC AGATCCCACA GAAGGTCAGT 

15551 AAAACCTCCA ATCCTGATAA GCAGCTATTC ACATAATGAA ACAGTATGGT 

15601 TTTATTTGGG TCTTGAATCT CATTTTCCAC TTAGCATAAC AGGTACCAAA 

15651 ATTTGCAAAA CATTATAGTA GTGTACATGG GCATAACTGA TCATTTGCCT 

157 01 ACTGAGTCTT GCTGTTACTG GAAACAACTT TCTTGATTGT CATTTGTTTA 
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15751 


TAATAAAATA 


GATATAATAA 




15801 


AAATCTAAAA 


GCGTGTGCCA 




15851 


TATTTCAAAG 


GACCAGAAAA 




15901 


ATCAATCTCA 


AGAATATCTT 




15951 


AGGGTAGTTT 


TGCTTAGAAA 




16001 


ATAATTAGAA 


GTTTCAAAAG 




16051 


ACAATTATTT 


AAGTAATGTA 




16101 


TTATTATTAC 


TGTAATTCTA 




16151 


AAACTATCCA 


AAAGATATTT 




16201 


TCAATAAAGG 


CAAGCATACT 




16251 


TTATGAAATT 


ATCTGTTAAG 




16301 


TTCAAATGTA 


TAAAATTTGT 




16351 


AAAAAAGGTA 


CAATCTCAAA 




16401 


GTAATTTTCT 


AGCTCATTCT 




16451 


AAGAGGCCAG 


GAAGGAAATA 




16501 


ACACACATAA 


TTGAAAGCAA 




16551 


TGGTAGAATG 


AGATTGGCTC 




16601 


CTTATAAAAT 


TTTTAATTCT 




16651 


TATTTTTTTA 


AATATTCACT 




16701 


GTAATAAAAT 


AATAATAGCA 




16751 


TAAAAGATCT 


TTGTAAAACG 




16801 


AGCTCTACAC 


ACCCCTGTCT 




16851 


AATTTATAAC 


TATATCTACA 




16901 


TCTGTGAAAT 


ACATGTGTTT 


D 


16951 


AAAAAACCAT 


CCACATTAGG 


. 


17001 


NNNNNNNNNN 


NNNNNNNNNN 


S S 


17051 


AAGACCCAAT 


CCCAAAGAAA 


D:I 


17101 


TTCTCTAGCT 


CATTCGTCTT 


?>= 


17151 


GGTCAGGAAG 


GAAATAGAGA 


s 


17201 


CACAAAATTG 


AAAGCAAATT 




17251 


AGAACCAGAA 


TAGTTCCATG 


I? 's 


17301 


TAAAATATTT 


TATTCTTATA 


17351 


TTTTAATGAT 


TC ACT GAT AT 


17401 


AAATAATAAT 


AGTATACTTA 


;s 


17451 


ACTTTTGTAA 


AAAGTCCTAC 




17501 


ACATACCCCT 


GTCTCAACCA 


re 


17551 


TAACTATATC 


TACATCCTTA 




17601 


AAATACATGT 


GTTTCTTCCA 




17651 


ACCATCCACA 


TTAGGAACCA 


?J;: 


17701 


ATGATCTTCT 


TGGTAGGTCT 


17751 


GAAAAAATGA 


ATAAATGTTA 




17801 


ATTATAAAAC 


TTTATTTTAC 




17851 


ATTGACTAAA 


ATTGTATAGA 




17901 


ACAGGAAATT 


CAGAAGGAAT 




17951 


TCTTTAGTAG 


CAATGCTCCA 




18001 


GCTTCAGTGA 


GTTATCTCGT 




18051 


AAAAGTTTAG 


AATAAAATAT 




18101 


GGTCATCCCA 


AAACCAAAGC 




18151 


CTATGAAGCT 


ATTTACCATG 




18201 


GTGATCAGCT 


TGATAACATA 




18251 


GAAATAAACT 


TCAAAACTAT 




18301 


AACAGTCATT 


ACCGATTCCT 




18351 


TAACATTGAC 


TATGTTATAC 




18401 


GGTAGACATG 


TTGAGGGATT 




18451 


TTGTTACTGG 


AATAGTTGTG 




18501 


CTTCATGGAA 


ATATTAGGTT 




18551 


TTCACATCTT 


AATTTTACCC 




18601 


CTCCACTATA 


TCTGTTTAAT 




18651 


CCAGAATCAA 


TCTTTTACTG 




18701 


ACGGTTTATC 


TACGTGTCTT 




18751 


TCTATGTGTG 


TTTTTCCCTT 




18801 


CAAGAATTCA 


CCATGATCAA 




18851 


TGGATCGAGT 


TTGTCATGCG 




ATAAAGCTCT ACCTTATATT TTAGGATTTG 
ATGATTCCAA AAAAAAATTC TGACATCTAT 
AGGAAAACTG ATATAAAAAA AAAAAGAAGA 
CTCATATTTG TGTGTATAAA AACTGTATTC 
TAAAAGCTCA GATTAATGTA GTCTTTCTAA 
TAAAATGTCA ATTACAATTA TAGTATAGTA 
ATTATTTATG ATACTCCACT AATTTTAACT 
GAATTTCACA CTTTAGATAG TGCTATATAT 
CATTTTATAT TTAGCTAAAA TACTTCAAAC 
AATTAGGAAT TTGAAATATT GTAATTTCAA 
TAGTTTGAAA CATCTATGCC GTTCTTTGTT 
ATAGGTGTCC AACAAAGAAA AATTGTGTAA 
GAAAATTTAT CATTGAACAG TGGAACATAA 
TCTTCAATAA AACAATTAAA TATAAGAAGA 
GAGAAGAAAA GACACCCGAT TATCCAAAAG 
ATTTTTATCT GCAGGGAACT GTAAATTTGA 
CATGAGTTAA AATGACACAC AGATCAGGTA 
TATATAAAAA TAGATTAGCC ACTGCTGAAT 
GGTATTCTCA TTCTCAAATA TTTTTAATTG 
TACCTAATAG GCAACTGGTA CACATTATTT 
TCCTACTATA TCTTTCAGTC TTTACGCGGT 
CAACCATCAC CTGAAGTACA ATGAGTTTAT 
TCCTTAGAAT GCTAATATCC TGTGGTTCAC 
CTTCCGTAGG TGTTATGGAG GTACAAAGGA 
AGCCAATACT CGGNNNNNNN NNNNNNNNNN 
NNNNNNNNNN NNNNNNNNNN NNNAAAAAAA 
ATTTATCATA GAACAATGGA ACATAGGTAA 
CAATAAAACA AATAAATATA AGAAGACAAA 
AGAAAAGATA ACCGATTATC CAAAATCACA 
TTATCTGTGG GGAACTGTAA ATTTGATGGT 
ATTTGAAATG ACACAGAGAT CATGTACTTA 
AGAAAATTGA GTAGCCAGTG CTGAATTACT 
TCTCATACTC AGATATTTTA ATTGATATTA 
ATAGTCAACT GGTACACATT ATTTGAAAGG 
TATGTCTTTT ACTGTTTACA CAGTACCTCT 
ACACCTGAAG TACAATGAGT TTATAATTTA 
GAGTGCTAAT ATCCTGTGGT TCAATCTGTG 
TAGGTGTTAT AGAGATACAA TGGAAAAAAA 
ATACTCGGCT GTATGATGGG ATACCCCAGA 
ATGAGAAAGT AAAAATATGA ACTAGACGAG 
AACAGCAAGC AAATTCAGCA AAGATCTAAA 
TTACTCTTTT GAAGCAGATA TAATTAAAGG 
TTCACACTTT CTATTGTTAA GGTGAGAGTG 
TAATGCCTAT TTTTCTGGAG ATAGAAATGA 
TGTGCTCACC TTCTAAAGAA AGTGCTGTAC 
AATTCCCATC TGTAGTTTTT AAATAATTTT 
CTCACCATTT CTCATCCAAT TTACATACTA 
TTTTATCACT CATGGTGGAA TGAATGGGAT 
GGGTCCCTAT GGTGGGAGTT CCCATATTTG 
GCTCACATGA AGGCCAAAGG AGCAGCTGTA 
GACAAGCGAA GATTTACTGA GGGCTTTGAG 
CGTAAGTACT ACTGCTTGTA CAGACTGATC 
ATTATACCAG AAAATGTTAA ATATCATCCT 
TTACTCCACA ATATTGAGTC ATTCATCACC 
GAAATTGTAG TTCATAGAGT GTCAAACTTT 
TAAGTTAACA ACTGGCTTAC TAAGCTTTTA 
CATTTTGTTA AGAATATACT CTTTCAGTCT 
ACTATGTAAC CAACAATATT CATGTCACAA 
AACATGTTCT TGGCTTGCAT AACATATACT 
TTATGAAAAC AAAACTACAA CTTTCTAAGT 
CCAGTTATAA AGAGAATGCT AT GAG AT TAT 
CCTGTAAAGC CCCTAGATCG AGCAGTCTTC 
CCACAAAGGA GCCAAGCACC TGCGATCAGC 
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18901 TGCCCATGAC CTCACCTGGT TCCAGCACTA CTCTATAGAT GTGATTGGGT 
18951 TCCTGCTGAC CTGTGTGGCA ACTGCTATAT TCTTGTTCAC AAAATGTTTT 
19001 TTATTTTCCT GTCAAAAATT TAATAAAACT AGAAAGATAG AAAAGAGGGA 
19051 ATAGATCTTT CCAAATTCAA GAAAGACCTG ATGGGGTAAT CCTGTTAATT 
19101 CCAGCCACAT AGAATTTGGT GAAAACCTTG CTATTTTCAT ATTATCTATT 
19151 CTGTTATTTT ATCTTAGCTA TATAGCCTAG AATTCCACGA TCATGAGGTT 
19201 GTGAGTATAT CTCATTCTTT CGTTGTATTT TCCTAGGTGT CTTTACTCTC 
19251 TTCTCTCACT TTGTGACACA AGGACATGAA TACATCTAAA TTTTCCTATT 
19301 TCTGATATGA CTGTTTTGAT GATGTCATTA CTTCTATAAC CTTAAGTGAT 
19351 AGGGTGACAT GCAATATGAT TATTCCTGGT GTGCGCCCAA AC AC AT GG AT 
19401 ATAAAGAGGT AAAAAACTTA AAATTCACAA AATTCAGTAA AC C AC AC AAA 
19451 TCAGGTAAGT GTTCTATGAG ATTAGCTGGC TATGAGAAAC ATAATGATGT 
19501 TTCTTTTTCA ATTTAAATAA GCCCTTCTAC ATAGCCAGCA TCAGTGATCT 
19551 CAGAAAATAA ATTGCTAATA AT GAT G AC AT GGCATTATGC TTAGAAAAGT 
19601 TTGCTGTATT TCCATAGACC TCATCTAGAT GTCATGGCCT ACATTTCTGC 
19651 CATC ACT CAA CCAATACTTT TTTCTGTTTT CTTGATGATA AAAAGACCTT 
19701 TCTCATGATT GCCATCAAAT AACAAAAGAA ACTATTTTTT TTCTCACATA 
19751 GAGAACATGT CAGTAAGATA TTCAAGGTGA ACAGATATTT TTGGGATTAG 
19801 TAACTATTTG AAATATGTGG TGATAATTAC TGAGTTTATA AAATTTATTT 
19851 GATAGTACAC TTAAAGAAGA TTTATATGTT TATTCTTTAA AAATGATGAA 
19901 TACTCATAAT TCTTATCTCT ATAATCAAAA GTATAATTTA CTGTAGAAAA 
19951 ATAAAGAGAT GCTTGTTCTG AAAGTAAGAT CAGTGAACTG CTTTTCAGTC 
20001 TCAATCTTTG AGAATTGTAA ATT CATC AAA TAATTGCTTA CATAGTAAAA 
f ; | 20051 ATTTAAGGTA TTAGAAAACC TGCATAACAA ATAGTATTAT ATATTAAATA 

^ 20101 TTTTGATATG TAAAGCTCTA CACAAAGCTA AATATAGTGT AATAATGTTT 

Jjf 1 - 20151 ACACTAGTAA GCAAATATGT TAATCTTCTC ATTTTTTTAC TGTCATATAA 

J!? S 20201 TCTTAGTGAT ATGCCTATTA ATAGTTTTAA ATAAATAAAT TGGCTTATCT 

Oil 20251 GGCTTTTTGA AAATTTTGAA ATTCTTACAG ATGTTGATTA GGTATATCTA 

J!! 20301 CAAATTAATT TCAATTTTAA AATGATGATA TAAAAATAAA TATAAGTATT 

20351 TTTCTTGTGT ATGTATACAA TAAATATAAA TAAAATTGTT TACTGTTTTG 
20401 AAAGTTTCTT AAGTTTTTAC ACTGATATGT TTTTTGACTT TTACAATATT 
J? 20451 ATTATAATCT AGGAAAAGCT GATTATATCT GTTTTAAGCC TCATCTTTTC 

P 20501 TCTGTAATTA AACACAGTAA TTTATTAACA TGCTGTGACA GGTGGGAAGC 

a 20551 CATTTCTGGA GTTGAGCCTG CTGACACTCT GGAGCTTTTT AGGTTGGACG 

20601 TTCATTGTAT GTGGGACTCT CTGCCTCTCG ATAGCTGTTG CTCATAAGAC 
20651 TCTCCTTCAT CAATCTGGCA TTGAATTTTG CGATCAGTTG CAATCAGAAT 
20701 CCAATTGGCC TTGCCGTTTT AGTATGTTCT ATCTTAACCA GCAATTTCTA 
^ 20751 ACCAGGAGCC TGCCCAGGTT TGTTCTGTCT TCCCTGTAAG AAGCTCCCAG 

CR! 20801 CATAAATATT CTAAATTTTA CACTACTAAT CTATTAACCA ACCTTTGGAC 

p 20851 CATGTTCACT TTAGGTTGAG CATAGTGTGA TGAGATGCAA ATTAAATTAC 

20901 AATCCTATAG GTGTGTGTTA TAAATTTTAA AGTGTATAAA TTAAATAACA 

20951 CATTCTAAGT ATCCAACAAA GGTCAAAAAA ATGATATAAA GTCACCAAAC (SEQ ID NO: 3) 



m 



FEATURES : 




Start: 


197 


Exon: 


197-911 


Intron : 


912-6498 


Exon: 


6499-6647 


Intron: 


6648-15412 


Exon: 


15413-15544 


Intron: 


15545-16929 


Exon : 


16930-16940 


Intron: 


16941-17632 


Exon : 


17633-17712 


Intron: 


17713-18101 


Exon: 


18102-18321 


Intron: 


18322-18774 


Exon : 


18775-19051 


Stop: 


19052 



CHROMOSOME MAP POSITION: 

Chromosome 4 
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ALLELIC VARIANTS (SNPs) : 

DNA 



Position 


Major 


Minor 


Domain 






1735 


A 


G 




Intron 






1922 


A 


G 




Intron 






2361 


C 


T 




Intron 






7371 


G 


C 




Intron 






9558 


G 


A 




Intron 






10579 


T 


G 


A 


Intron 






10625 


C 


T 




Intron 






11147 


A 


G 




Intron 






15131 


C 


T 


G 


Intron 






15221 


A 


G 


T 


Intron 






15778 


T 


C 




Intron 






15895 




A 




Intron 






19786 




T 




Beyond 


ORF{3' 


') 


20157 


G 


A 




Beyond 


0RF(3" 


') 


20246 


T 


C 




Beyond 


ORF{3' 


') 


20681 


C 


A 




Beyond 


0RF(3" 


') 


20819 


T 


C 




Beyond 


ORF(3' 


') 



Context : 



DNA 

Position 

17 35 TACTTT ACTGATAT ACACCT ATTTGACTTATTTTTATTTCTTTGCTTTACTGATAAAAAG 

TTGTTTTGCTTTGCAATTTTCATATAGTTGTGATCAGAGCTGGTCAATGCAAGACATGTT 
TTTATCCAAATATGTTTGAGAATTATGTAGAAACATGAAAAAAGGTACAATTATATCCGA 
CACTAAAATATTGTTTAATGTATTCCAACGAATTCTTATGCATAGACTGTTTCACAGAAC 
TAATATTCAGAGGATCCCAGTTCAAATGTCCTTAGCCTTAGACATGATTTGAATTTACAT 
[A,G] 

TATTGATTTGCTTTAAATAATTTTCCATTCAGTAAGCTGTGCCTAGCTGCAGATAGCCTA 
CCAGGCTTTATGGATCTAGGTAAACAATACAAATCTCTTGGCCTCAAGTCTACATTCAGA 
TATTAATTTAAAGGGGTACAGCTATATAGAGGTCACTGGCAAATTTTGGTAAAATAGGAT 
TATAGTAAAAGCCCCCTGACAAGATTGAAATTTAAAATAAAACAAAAGTGTTATCAAAGG 
GGTGAAAGAGCATTTTCCAATAAACAAAAGTGGGTTCTGGCCATGCATTCAGAAATTCCC 

1922 ATATTGTTTAATGTATTCC7VACGAATTCTTATGCATAGACTGTTTCACAGAACTAATATT 
CAGAGGATCCCAGTTCAAATGTCCTTAGCCTTAGACATGATTTGAATTTACATATATTGA 
TTTGCTTTAAATAATTTTCCATTCAGTAAGCTGTGCCTAGCTGCAGATAGCCTACCAGGC 
TTTATGGATCTAGGTAAACAATACAAATCTCTTGGCCTCAAGTCTACATTCAGATATTAA 
TTTAAAGGGGTACAGCTATATAGAGGTCACTGGCAAATTTTGGTAAAATAGGATTATAGT 
[A, G] 

AAAGCCCCCTGACAAGATTGAAATTTAAAATAAAACAAAAGTGTTATCAAAGGGGTGAAA 
GAGCATTTTCCAATAAACAAAAGTGGGTTCTGGCCATGCATTCAGAAATTCCCCAACAAT 
TCTTTAAAAATCATGGAGCAGCTTGATATATAAGAAATTCATTTAATAACTATATTTATT 
ATGTAGCTCCAACTTACTAAATTATTGATTATTATATATTTTATAGAATTATCTATTGTG 
AGTCTAAATCAAGAGTATATATTCAAACAACTATAGGAAAAGGGATATCAGTCAATTTCA 

2361 CAGCTTGATATATAAGAAATTCATTTAATAACTATATTTATTATGTAGCTCCAACTTACT 
AAATTATTGATTATTATATATTTTATAGAATTATCTATTGTGAGTCTAAATCAAGAGTAT 
ATATTCAAACAACTATAGGAAAAGGGATATCAGTCAATTTCAATTCAAGGATTTATTTCC 
ATAAGTGCTTACGCACAGGTGTATTTCATTTTATTATACATTGCTTTATTGTCCTTCACA 
AAAATTGCAATTTACAAATTAAAGGTTTTTGAAAACCTTGAATCAAGCTAATCAATTTGG 
[C,T] 

GTAATATTTCCAACAACAAGTGTGTACTTTTGACTCTATCACATATTGGCATTTATCATG 
CTTTTTCAAATTTTTCATTGTTATATCTGTTACGGTGATCTGGGATCAGTGTTCCTTGAT 
GGTTACACGTTTATTAGCTTGGGGGCACCTTGATGTGTTACAATATAAGACAGCAAACTT 
AATTATAAATGTTGTGCATGTACTAACTGCTCCGCTGATTCGTTTCCCCATCCCACTTCT 
TCTTAGGCCTCCCTATTCCCTGAGACACAGTAATATAACATACAATGACTTCTAAATGTT 
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w 
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& 
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AAATAAGTGATGAAGGAAGACAAAGAATAAAAGAAAATTTCAATAAATGGAAAGTTTAAG 
TGTTTAATGATAGTGATGACTTTTACTCAAATAAGTGCTTAGAAGTCATCTTGTTTGTGA 
TTTATATGATGAATTCTGTGTTGTGACTATCCACTTTGAGCTCGTGAGAATGTTAGGTGA 
GGTTTAATAAAAGCCATTTGAGAAAAACAAGGTTTCAACCTCTGTGGACAGAAATCTAAA 
TATCGATAGTTATCAGGACAAAGTAGAGCTCATAGAAATAATTTTGCAGCCTGCAGGTTT 
[G,C] 

TTTTGGAGTGAAAATAAAATTGTATACTATATTCCTAAATCATCAGAGGAAAAAATTTAT 
AGTTCAAGGAATGTTGAAAGAAACAATATTGAGAAGTAAAAGTGAGTAATAGTTGTTATA 
GTTTTTTAATAGTTTTGTAAGTATGTCTTGAGTTCACTGTCCCAAAAGTGGCTATTAGCT 
CTAGCCTTGACCTGACAAGGTTCTAGGATATTTAGTCATGGATGTTCATAATCTACCTCT 
TACGGGATACTTTTTATTCTGATGAACAGCCTAATGCCTAAGTGTGCAATCTATACCAAG 

TCCACATATAAGGGATACCATGCTATCTCTGCCTGGATTATTTCAGTTAACATTATGCCC 
TGGAGGTTCATTCATGTTTCTACAAATGACAGGATTTCATTCTTTTTTTTCCAATATATA 
TTTAATGAAATGGATATATATAAACATTGGAAAATGTATATATATATATATATCTCCAGT 
GGAATGCTATTGAGCTATAAAAAAGTTAATATATAATAGAAATAAAGCTTATATATATCT 
AATGGAATGGATATATATATATAATGGAATAGAAATATATATCTATACATATAAACACAC 
[G,A] 

C AAT AT AC AT AT C CAT T T CAT T G CAT AT AT AT AT AT AT AG AG AG AG AGAGAGAG AG AT AT 
TTTCAAATGTGTGTATATATATCCAATGGAATGGACATATATATATGTATATTTTTTCCA 
TATTTTCTTTATGTATTTCTTCATTAATGGATGTTTAGGTTGATTCATCCCTTGGGTATA 
TGAATAATGTTGATGTAAACATAGAAGGACAGATATCTCTATGACTTCTTAGTTTATTTA 
AATATACACCCAGTAATGGAAATGCTGTATAATATGGTAGTTCTATTTTCATTTTTTGAG 

CAATGCTTTGATATAACCCATTTATCTACTTTTCCTTTTGTTGGCTGTGCTTCTGAAGTC 
CTATCCAAAAAAATCCTTGCCTAGACCAATGTCACAAATCATTCCTCCTACAGTTTCTTC 
TAGTAGTTGTATAATGTTTGGCCTTATATTTAACTTTGTAATTCATTTTTACTTACTTTG 
TATATGGTGAGGGATAGAGGTCTAGTTTCATTTTCTGCATGTGGATATGCAGTTTTCCTA 
GCACCATTTAGTGAAGAGGTTGCCTTTTTTCTATTATGTGTTCTTGGCACCTTTGTCAAA 
[T,G,A] 

GTCAGTTAGCTGCTATATTCCTCCATTTGTGTTGTTATAGAGGAACACATGAGACTAGCA 
AATTTATATATCAAATAGAATTATTTGAATGATAGTTCTGCATACTGTACAAGAAGCACA 
GCACTGACTTCTGCTTGGCCTCTGGTAAGGTTCTCAAGATGCTTCCACTTGTGGTAGAAG 
GCAAACATGAGCTGGTATATGCAAAGGTCTCATGACAAGAGAGGAAACCATAAAGAGGGG 
ATGTGAGGGAGTGCCAGGTTTTGTAAAACAACTAGCTCTTCTGGGAACTAATAGAGTAAA 

GTGCTTCTGAAGTCCTATCCAAAAAAATCCTTGCCTAGACCAATGTCACAAATCATTCCT 
CCTACAGTTTCTTCTAGTAGTTGTATAATGTTTGGCCTTATATTTAACTTTGTAATTCAT 
TTTTACTTACTTTGTATATGGTGAGGGATAGAGGTCTAGTTTCATTTTCTGCATGTGGAT 
ATGCAGTTTTCCTAGCACCATTTAGTGAAGAGGTTGCCTTTTTTCTATTATGTGTTCTTG 
GCACCTTTGTCAAAAGTCAGTTAGCTGCTATATTCCTCCATTTGTGTTGTTATAGAGGAA 
[C,T] 

ACATGAGACTAGCAAATTTATATATCAAATAGAATTATTTGAATGATAGTTCTGCATACT 
GTACAAGAAGCACAGCACTGACTTCTGCTTGGCCTCTGGTAAGGTTCTCAAGATGCTTCC 
ACTTGTGGTAGAAGGCAAACATGAGCTGGTATATGCAAAGGTCTCATGACAAGAGAGGAA 
ACCATAAAGAGGGGATGTGAGGGAGTGCCAGGTTTTGTAAAACAACTAGCTCTTCTGGGA 
ACTAATAGAGTAAAAATTCGCCTCCCAGGCAGGGGATTAATCTATTCATGAGGGATCTGC 

ACAACTAGCTCTTCTGGGAACTAATAGAGTAAAAATTCGCCTCCCAGGCAGGGGATTAAT 
CTATTCATGAGGGATCTGCTTCCATGACAAAGGCACATTCTGTTAGATTCTACCCCCAAT 
ATTGGGGATCAAATTTTAACATGAAGTGTGGAGGGCTCAAATATCCATACTATGGCAGCA 
GTAAATGCATAAATTTATTTTGTGGATCTCTATTCTATATAGTATTGGTGTATGTATCTG 
TTTTCATGCCACTGCCATACTGTTTTGGTGATGATATCTATGCTATATATGTGTGTGTGT 
[A,G] 

TATATATATTATATATATGTATATATGTGTATATTATATATATGTATATATGTGTATATT 
ATATATATATAATACTTTAAGTTTTATATATATATAAAATACTTTAAGTTCAAGGGTACA 
TGTGCAGGATGTGCAGGTCAGTTACATAGGTATACATGTGCCATTTTGGTTTGCTGCATG 
CATCAACTCATCATTACATTAGGTATTTCTCCTAATGCTATCCCTCCACCAGCCACCCAA 
CCCCCAACAGGCCAGGTGTGTGATGTTCCCCGCCCTGTGTCCATGTGTTCTCATTGTTCA 

CAGGGAAGTTTTCTGTGACTGATGCTATTTTGGTTTGGAGAATGGGGTGATGTGGCACAA 
TGATCTTTCTTCTTTCTGGTCATGGATTTTTTAATTTCCATGAACCCATAAGATTTTTCA 
CTTTTCTTCTGAGCTCTGGTGCTTTCAGAGTGGTATTTTTATATTCGAATAGTTGCTAGT 
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TGTACTTTTAAAAGCGATTGATGCTGGAGGTCTTCTATTCCACCATCTCGCTGATGTCAG 
TCCTCAAATAATAATTTTATATTTTAGCAAATTATTTTGGTTTTAGGATTTTGTGTCTAC 
[C,T,G] 

TGACACAGACATGAAAAGAGATGTACTCATTACTGAAACTTTTTGCATACTGTTTTGGTT 
GTGCGCCTTTTCTAGTATGAATGATTACATATTTAAGCCACATGTTTTATACATAGACTG 
TCCTTTAAAGAGACTAGATAGTTCTGTGTGTCAGCATATAGGGACAGAATATAACTACAC 
ATTAATAATTTCTCAAGTATTTATTTTAGAAGTGTAAGTAACCTTTATTTTAATTTTTGT 
TATATTATGCCTCTGTAATGCAGATAAATTTTTATCTTCAGGAAATGGAAAATTTTGTCC 



15221 TTAATTTCCATGAACCCATAAGATTTTTCACTTTTCTTCTGAGCTCTGGTGCTTTCAGAG 
TGGTATTTTTATATTCGAATAGTTGCTAGTTGTACTTTTAAAAGCGATTGATGCTGGAGG 
TCTTCTATTCCACCATCTCGCTGATGTCAGTCCTCAAATAATAATTTTATATTTTAGCAA 
ATTATTTTGGTTTTAGGATTTTGTGTCTACGTGACACAGACATGAAAAGAGATGTACTCA 
TTACTGAAACTTTTTGCATACTGTTTTGGTTGTGCGCCTTTTCTAGTATGAATGATTACA 
[A,G,T] 

ATTTAAGCCACATGTTTTATACATAGACTGTCCTTTAAAGAGACTAGATAGTTCTGTGTG 
TCAGCATATAGGGACAGAATATAACTACACATTAATAATTTCTCAAGTATTTATTTTAGA 
AGTGTAAGTAACCTTTATTTTAATTTTTGTTATATTATGCCTCTGTAATGCAGATAAATT 
TTTATCTTCAGGAAATGGAAAATTTTGTCCAGAGTTCAGGGGAAGATGGTATTGTGGTGT 
TTTCTCTGGGGTCACTGTTTCAAAATGTTACAGAAGAAAAGGCTAATATCATTGCTTCAG 

15778 GTTTCAAAATGTTACAGAAGAAAAGGCTAATATCATTGCTTCAGCCCTTGCCCAGATCCC 
ACAGAAGGTCAGTAAAACCTCCAATCCTGATAAGCAGCTATTCACATAATGAAACAGTAT 
GGTTTTATTTGGGTCTTGAATCTCATTTTCCACTTAGCATAACAGGTACCAAAATTTGCA 
O AAACATTATAGTAGTGTACATGGGCATAACTGATCATTTGCCTACTGAGTCTTGCTGTTA 
CTGGAAACAACTTTCTTGATTGTCATTTGTTTATAATAAAATAGATATAATAAATAAAGC 

tT,c] 

CTACCTTATATTTTAGGATTTGAAATCTAAAAGCGTGTGCCAATGATTCCAAAAAAAAAT 
Oil T CT GAC AT CT AT TAT T T C AAAGGAC C AG AAAAAGG AAAAC TGAT AT AAAAAAAAAAAG AA 

42 GAATCAATCTCAAGAATATCTTCTCATATTTGTGTGTATAAAAACTGTATTCAGGGTAGT 
ijv] TTTGCTTAGAAATAAAAGCTCAGATTAATGTAGTCTTTCTAAATAATTAGAAGTTTCAAA 
"% AGTAAAATGTCAATTACAATTATAGTATAGTAACAATTATTTAAGTAATGTAATTATTTA 

158 95 TATGGTTTTATTTGGGTCTTGAATCTCATTTTCCACTTAGCATAACAGGTACCAAAATTT 
a GCAAAACATTATAGTAGTGTACATGGGCATAACTGATCATTTGCCTACTGAGTCTTGCTG 
Q TTACTGGAAACAACTTTCTTGATTGTCATTTGTTTATAATAAAATAGATATAATAAATAA 
f^i AGCTCTACCTTATATTTTAGGATTTGAAATCTAAAAGCGTGTGCCAATGATTCCAAAAAA 
I T AAATTCTGACATCTATTATTTCAAAGGACCAGAAAAAGGAAAACTGATATAAAAAAAAAA 
** [-,A] 

m GAAGAATCAATCTCAAGAATATCTTCTCATATTTGTGTGTATAAAAACTGTATTCAGGGT 
p AGTTTTGCTTAGAAATAAAAGCTCAGATTAATGTAGTCTTTCTAAATAATTAGAAGTTTC 
s^l AAAAGTAAAATGTCAATTACAATTATAGTATAGTAACAATTATTTAAGTAATGTAATTAT 
TTATGATACTCCACTAATTTTAACTTTATTATTACTGTAATTCTAGAATTTCACACTTTA 
GATAGTGCTATATATAAACTATCCAAAAGATATTTCATTTTATATTTAGCTAAAATACTT 

19786 GAAACATAATGATGTTTCTTTTTCAATTTAAATAAGCCCTTCTACATAGCCAGCATCAGT 
GATCTCAGAAAATAAATTGCTAATAATGATGACATGGCATTATGCTTAGAAAAGTTTGCT 
GTATTTCCATAGACCTCATCTAGATGTCATGGCCTACATTTCTGCCATCACTCAACCAAT 
ACTTTTTTCTGTTTTCTTGATGATAAAAAGACCTTTCTCATGATTGCCATCAAATAACAA 
AAGAAACTATTTTTTTTCTCACATAGAGAACATGTCAGTAAGATATTCAAGGTGAACAGA 
[~,T] 

ATTTTTGGGATTAGTAACTATTTGAAATATGTGGTGATAATTACTGAGTTTATAAAATTT 
ATTTGATAGTACACTTAAAGAAGATTTATATGTTTATTCTTTAAAAATGATGAATACTCA 
TAATTCTTATCTCTATAATCAAAAGTATAATTTACTGTAGAAAAATAAAGAGATGCTTGT 
TCTGAAAGTAAGATCAGTGAACTGCTTTTCAGTCTCAATCTTTGAGAATTGTAAATTCAT 
CAAATAATTGCTTACATAGTAAAAATTTAAGGTATTAGAAAACCTGCATAACAAATAGTA 

20157 ACACTTAAAGAAGATTTATATGTTTATTCTTTAAAAATGATGAATACTCATAATTCTTAT 
CTCTATAATCAAAAGTATAATTTACTGTAGAAAAATAAAGAGATGCTTGTTCTGAAAGTA 
AGATCAGTGAACTGCTTTTCAGTCTCAATCTTTGAGAATTGTAAATTCATCAAATAATTG 
CTTACATAGTAAAAATTTAAGGTATTAGAAAACCTGCATAACAAATAGTATTATATATTA 
AATATTTTGATATGTAAAGCTCTACACAAAGCTAAATATAGTGTAATAATGTTTACACTA 
[G,A] 
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TAAGCAAATATGTTAATCTTCTCATTTTTTTACTGTCATATAATCTTAGTGATATGCCTA 
TTAATAGTTTTAAATAAATAAATTGGCTTATCTGGCTTTTTG7\AAATTTTGAAATTCTTA 
CAGATGTTGATTAGGTATATCTACAAATTAATTTCAATTTTAAAATGATGATATAAAAAT 
AAATATAAGTATTTTTCTTGTGTATGTATACAATAAATATAAATAAAATTGTTTACTGTT 
TTGAAAGTTTCTTAAGTTTTTACACTGATATGTTTTTTGACTTTTACAATATTATTATAA 

GAAAAATAAAGAGATGCTTGTTCTGAAAGTAAGATCAGTGAACTGCTTTTCAGTCTCAAT 
CTTTGAGAATTGTAAATTCATCAAATAATTGCTTACATAGTAAAAATTTAAGGTATTAGA 
AAACCTGCATAACAAATAGTATTATATATTAAATATTTTGATATGTAAAGCTCTACACAA 
AGCTAAATATAGTGTAATAATGTTTACACTAGTAAGCAAATATGTTAATCTTCTCATTTT 
TTTACTGTCATATAATCTTAGTGATATGCCTATTAATAGTTTTAAATAAATAAATTGGCT 
[T,C] 

ATCTGGCTTTTTGAAAATTTTGAAATTCTTACAGATGTTGATTAGGTATATCTACAAATT 
AATTTCAATTTTAAAATGATGATATAAAAATAAATATAAGTATTTTTCTTGTGTATGTAT 
ACAATAAATATAAATAAAATTGTTTACTGTTTTGAAAGTTTCTTAAGTTTTTACACTGAT 
ATGTTTTTTGACTTTTACAATATTATTATAATCTAGGAAAAGCTGATTATATCTGTTTTA 
AGCCTCATCTTTTCTCTGTAATTAAACACAGTAATTTATTAACATGCTGTGACAGGTGGG 

TAAAATTGTTTACTGTTTTGAAAGTTTCTTAAGTTTTTACACTGATATGTTTTTTGACTT 
TTACAATATTATTATAATCTAGGAAAAGCTGATTATATCTGTTTTAAGCCTCATCTTTTC 
TCTGTAATTAAACACAGTAATTTATTAACATGCTGTGACAGGTGGGAAGCCATTTCTGGA 
GTTGAGCCTGCTGACACTCTGGAGCTTTTTAGGTTGGACGTTCATTGTATGTGGGACTCT 
CTGCCTCTCGATAGCTGTTGCTCATAAGACTCTCCTTCATCAATCTGGCATTGAATTTTG 
[C,A] 

GATCAGTTGCAATCAGAATCCAATTGGCCTTGCCGTTTTAGTATGTTCTATCTTAACCAG 
CAATTTCTAACCAGGAGCCTGCCCAGGTTTGTTCTGTCTTCCCTGTAAGAAGCTCCCAGC 
ATAAATATTCTAAATTTTACACTACTAATCTATTAACCAACCTTTGGACCATGTTCACTT 
TAGGTTGAGCATAGTGTGATGAGATGCAAATTAAATTACAATCCTATAGGTGTGTGTTAT 
AAATTTTAAAGTGTATAAATTAAATAACACATTCTAAGTATCCAACAAAGGTCAAAAAAA 

AATTTATTAACATGCTGTGACAGGTGGGAAGCCATTTCTGGAGTTGAGCCTGCTGACACT 
CTGGAGCTTTTTAGGTTGGACGTTCATTGTATGTGGGACTCTCTGCCTCTCGATAGCTGT 
TGCTCATAAGACTCTCCTTCATCAATCTGGCATTGAATTTTGCGATCAGTTGCAATCAGA 
ATCCAATTGGCCTTGCCGTTTTAGTATGTTCTATCTTAACCAGCAATTTCTAACCAGGAG 
CCTGCCCAGGTTTGTTCTGTCTTCCCTGTAAGAAGCTCCCAGCATAAATATTCTAAATTT 
[T,C] 

ACACTACTAATCTATTAACCAACCTTTGGACCATGTTCACTTTAGGTTGAGCATAGTGTG 
ATGAGATGCAAATTAAATTACAATCCTATAGGTGTGTGTTATAAATTTTAAAGTGTATAA 
ATTAAATAACACATTCTAAGTATCCAACAAAGGTCAAAAAAATGATATAAAGTCACCAAA 
C 
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